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Nicotinamide phosphoribosyltransferase (NAMPT) is a rate-limiting enzyme in the salvage pathway of 
nicotinamide adenine dinucleotide biosynthesis. NAMPT protein is a secreted plasma biomarker in 
inflammation and in cancer. The NAMPT enzymatic inhibitor, FK866, acts as an inducer of apoptosis and is 
a cancer therapeutic candidate, however, little is known regarding the influence of NAMPT on cancer 
biological mechanisms or on the prognosis of human cancers. We interrogated known microarray data sets 
to define NAMPTknockdown-influenced gene expression to demonstrate that reduced NAMPT expression 
strongly dysregulates cancer biology signaling pathways. Comparisons of gene expression datasets of four 
cancer types generated a N39 molecular signature exhibiting consistent dysregulated expression in multiple 
cancer tissues. The N39 signature provides a significant and independent prognostic tool of human 
recurrence-free survival in lung and breast cancers. Despite the absence of clear elucidation of molecular 
mechanisms, this study validates NAMPT as a novel "oncogene" with a central role in carcinogenesis. 
Furthermore, the N39 signature provides a potentially useful tool for prediction of recurrence-free survival 
in lung and breast cancer and validates NAMPT as a novel and effective therapeutic target in cancer. 



Nicotinamide phosphoribosyltransferase (NAMPT), also known as pre-B ceil colony-enhancing factor 1, 
encodes a protein that catalyzes the condensation of nicotinamide with 5-phosphoribosyl-l- 
pyrophosphate to yield nicotinamide mononucleotide, and is the rate-limiting enzyme in the salvage 
pathway of nicotinamide adenine dinucleotide (NAD) synthesis'. This multifunctional enzyme was first cloned 
from human lymphocytes, and named pre-B cell colony- enhancing factor as a secreted cytokine^. NAMPT was 
further confirmed as a pro-inflammatory cytokine' that inhibits neutrophil apoptosis"* and exerts endotoxin-like 
responses to trigger NFkB signaling pathways^. Given the dual intracellular enzymatic activity {iNAMPT) and 
extracellular proinflammatory cytokine characteristics (eNAMPT), NAMPT has been implicated in many 
important biological processes, including metabolism, stress response, apoptosis and aging', the majority of 
which are closely related to carcinogenesis signaling. 

Secreted NAMPT or eNAMPT is elevated in plasma in a variety of human cancer types including gastric, 
endometrial, hepatocellular, colorectal, and breast cancers'", indicating circulating eNAMPT protein as a potential 
plasma biomarker or prognosis factor of cancer'. Although the molecular mechanism of eNAMPT on carcino- 
genesis is unknown, the inflammatory response of eNAMPT on tissues promotes tumor proliferation and redox 
adaptive responses". Moreover, as an enzyme, intracellular NAMPT or iNAMPTis responsible for regeneration of 
intracellular NAD*, which is a multifunctional co-factor in many ceflular events, such as transcriptional regu- 
lation, longevity and caloric-restriction responses, ceU cycle progression, apoptosis, DNA repair, circadian 
rhythms, chromatin dynamics regulation, and telomerase activity, processes closely related to cancer pathogen- 
esis'". Thus, inhibition of NAMPT enzymatic activity by the highly specific, noncompetitive inhibitor, FK866, 
induced apoptosis in murine neutrophils' and human liver carcinoma cells'". Although potentially mediated by 
NAD* depletion, the clear mechanism of FK866-mediated apoptosis was not characterized. Additional studies 
determined that FK-866 produced premature senescence, possibly linked to decreased activity of the NAD*- 
dependent enzyme, SIRTl". Similarly, little is known of the molecular mechanisms by which iNAMPT may 
contribute to oncogenesis or influence human cancer prognosis. 
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We conducted meta-analysis of genome-wide expression data to 
identify NAMPT-influenced genes implicated in cancer pathobiol- 
ogy. First, we identified differentially expressed genes utilizing 
microarray data from two independent human cell lines (primary 
and cancer cells) and wild-type (WT) cells and NAMPT knock down 
(KD) cells. These differentially-expressed genes were denoted as 
NAMPT-influenced genes with gene ontology analysis indicating 
enriched cancer- related pathways. Second, a prognostic gene sig- 
nature derived from the NAMPT-influenced genes was developed 
and expression was compared in normal and colon, lung, pancreatic, 
and thyroid cancers. Thirty-nine NAMPT-influenced genes were 
identified as being commonly differentially expressed in tumor tis- 
sues and comprised a multi- molecular cancer outcome predictor. 
Our studies indicate this molecular signature effectively predicts 
recurrence-free survival in lung and breast cancer in a manner inde- 
pendent of standard clinical and pathological prognostic factors. 

Results 

NAMPT-influenced genes. We compared the gene expression pat- 
tern between wild type and NAMPT-silenced human endothelial cell 
and a breast cancer cell line to identify genes potentially regulated by 
NAMPT. Two independent microarray datasets containing gene 
expression information for both wild type and NAMPT-sUenced 
cells were collected from the Gene Expression Omnibus (GEO) 
database'^: one dataset was derived from a MCF-7 breast cancer 
cell line (GSE 13449)" and the second dataset was from human 
pulmonary microvascular endothelial cells (GSE34512)'''. The 
genes differentially expressed between WT and NAMPT silenced 
cells in both datasets with accordant direction were retained as 
NAMPT-influenced genes. At the specified significance level of 
false discovery rate (FDR) < 5% and fold change >1.1 (see 
Methods for details), 462 genes were found to be commonly 
differentially expressed between WT and NAMPT silenced cells, 
among which 361 genes were up-regulated while 101 genes were 
down-regulated in NAMPT silenced cells (Supplementary Table 
SI). We next searched the enriched Kyoto Encyclopedia of Genes 
and Genomes (KEGG)'"" physiological pathways among the 
dysregulated genes revealing genes enriched in cancer-related 
KEGG terms, such as "Pathways in cancer", "Colorectal cancer", 
"Melanogenesis", "Renal cell carcinoma", and "Apoptosis" 
(Figure 1, Fisher's exact test). These findings suggested that the 
NAMPT-influenced genes are involved in human cancer pathology. 

To determine the depth of involvement of NAMPT-influenced 
genes in human cancers, we explored expression differences of these 
genes between normal and tumor tissues from lung (GSE18842)"', 
colon (GSE23878)", pancreatic (GSE15471)'^ and thyroid 
(GSE33630) cancers. Paired normal and tumor tissues from 44 lung, 
19 colon, 36 pancreatic, and 44 thyroid cancer patients were 
included. Paired t-test was used to detect the differentially-expressed 
genes between the normal and tumor tissues. In total, 39 genes were 
identified as mutually differentially expressed and concordant in 
expression with the NAMPT-silenced model (P < 0.05 after 
Benjamini-Hochberg adjustment) in at least three out of four cancer 
types: lung cancer (Fig. 2), colon cancer (Supplementary Fig. SI), 
pancreatic cancer (Supplementary Fig. S2), and thyroid cancer 
(Supplementary Fig. S3). We designated this NAMPT-influenced 
39-gene set as the N39 gene signature (Table 1). 

N39 predicts recurrence-free survival in lung and breast cancers. 

We hypothesized that the N39 signature would be predictive of 
tumor outcome in lung and breast cancer patients. We constructed 
a scoring system to assign each patient a risk score, representing a 
linear combination of the N39 gene expression values weighted by 
the coefficients obtained from training cohorts (GSE8894" for lung 
cancer and GSE2034^° for breast cancer) (see Methods for details). 
N39-positive patients were defined as those having risk scores greater 
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Figure 1 | The top 20 KEGG pathway terms associated with the NAMPT- 
influenced genes. The P-values were calculated by Fisher's exact test. The 
red dash line denotes the significance level of ot = 0.05. 

than the group median. As expected, there was a significandy 
reduced recurrence-free survival for N39-positive patients in the 
training cohorts (Supplementary Figure S4 and Table 2). 

We tested the ability of the N39 based risk score to classify patients 
into prognostic groups in independent validation cohorts. For 
each cancer type, two validation cohorts were collected: Lungl 
(GSE31210)^' and Lung2 (GSE37745)^^ for lung cancer, and 
Breastl (GSE25066)" and Breast2 (GSE21653)^'' for breast cancer. 
Kaplan-Meier survival curves demonstrated a significantly reduced 
recurrence-free survival for N39-positive patients in the validation 
cohorts (log-rank test: P = 5.4 X 10"= for Lungl; P = 0.011 for 
Lung2; P = 2.9 X lO"' for Breastl; and P = 7.2 X lO"" for 
Breast2) (Figure 3). Univariate Cox proportional hazards regression 
indicated that N39-positive patients exhibited significantly increased 
risk for recurrence (fold increase or FI) in these 4 cohorts: 2.88-FI for 
Lungl, 2.08-FI for Lung2, 2.27-FI for Breastl, and 2.12-FI for Breast2 
(Table 2). These findings collectively indicate that N39 is predictive 
of recurrence-free survival in lung and breast cancer. 

In a recent computational study, 47 published breast cancer pro- 
gnostic signatures were compared with signatures comprised of ran- 
domly selected genes. Approximately 60% of the published 
signatures were not significantly improved over random signatures 
of identical size with the majority of random gene signatures signifi- 
cantly associated with breast cancer outcome^^. We performed a 
resampling test to determine whether the prognostic power of N39 
was significantly better than random gene signatures. We con- 
structed 1,000 random gene signatures of identical size as N39 (39 
genes) with Cox proportional hazards regression of survival con- 
ducted for each resampled gene signature. The association between 
each random gene signature and recurrence- free survival was mea- 
sured by the Wald statistic, the ratio of Cox regression coefficient to 
its standard error. Our alternative hypothesis was that the Wald 
statistic value of N39 should be higher than that of randomized gene 
signatures if N39 was more predictive than randomized signatures. 
Figure 4 indicates that the Wald statistic of N39 was significandy 
higher than that of randomized gene signatures (Right-tailed: P = 
0.026 for Lungl; P = 0.020 for Lung2; P = 0.009 for Breastl; and P = 
0.011 for Breast2), suggesting that the nuU hypothesis that the asso- 
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Figure 2 | Comparison of N39 gene expression between normal and lung cancer tissues. Paired normal and tumor tissues from 44 lung cancer patients 
were included in the comparison. All the listed genes are differentially expressed between normal and tumor tissues except ZMIZ2. Y-axis: 
log2-transformed expression values. 



elation between N39 and recurrence-free survival is by chance is 
rejected. 

N39 is independent of standard clinical and pathological 
prognostic factors. We investigated the performance of the N39 
signature in comparison with standard clinical and pathological 
factors associated with prognosis in human cancers. For the Lungl 
cohort, we considered factors including patient age, gender, smoking 
history, stage, EGPR/KRAS/ALK gene alteration status, and MYC 
protein levels. For the Lung2 cohort, we considered age, gender, 
and stage into account. For the Breastl cohort, we included factors 
such as age, lymph node status, histological grade, tumor size, 
estrogen receptor (ER) status, and progesterone receptor (PR) 
status. For the Breast2 cohort, factors such as age, grade, ER and 
PR status, and TP53 gene alteration status were included in the 
multivariate model. A multivariate Cox proportional hazards 
regression of survival indicated that N39 status remained a 
significant covariate in relation to the clinico-pathological factors 
in each vaUdation cohorts (P = 1.2 X 10"^ for Lungl; P = 6.4 X 
10"' for Lung2; P = 2.9 X 10"' for Breastl; and P = 1.7 X 10"^ for 
Breast2) (Table 3). 

In the Lung2 and Breast2 cohorts, N39 status was the only signifi- 
cant covariate in the multivariate model (Table 3). However, in the 
Lungl cohort, patient age, stage, and EGPR/KRAS/ALK alteration 
status were also significant variables. Therefore, we further stratified 
the patients in the Lungl cohort according to respective significant 
factors and redid Cox proportional hazards regression. For patients 
aged < 60 and s60, N39-positive patients had significant increased 
risk for recurrence, 2.62-FI (P = 0.038) and 2.57-FI (P = 0.005), 
respectively. For patients with stage I cancer (Lungl cohort only 



includes patients with stage I and II lung cancer), N39-positive 
patients exhibited significantly increased risk for recurrence (2.48- 
FI, P = 0.012), however, no significant difference was observed 
between N39-positive and -negative groups for patients with stage 
II lung cancer. For patients without and with EGPR/KRAS/ALK 
alteration, N39-positive patients had a 2.35-FI (P = 0.041) and 
2.36-FI (P = 0.015) increased risk for recurrence, respectively. We 
also checked the performance of the N39 signature in patients with- 
out and with smoking history respectively and found N39-positive 
patients exhibited increased risk for recurrence in never-smokers 
and ever-smokers, 2.72-FI (P = 0.012) and 2.19-FI (P = 0.034) 
respectively. Kaplan-Meier survival curves demonstrated signifi- 
cantly reduced survival for N39-positive patients in each subset 
grouped by age, stage, EGPR/KRAS/ALK alteration status, and smok- 
ing history, with the exception of patients with stage II lung cancer 
(Fig. 5A), presumably reflecting the reduced sample size. 

In the Breastl cohort, lymph node status, tumor size, and ER status 
were significant clinicopathological factors in addition to N39 status 
(Table 3). We stratified patients in the Breastl cohort according to 
these factors. For patients with and without lymph node involve- 
ment, N39-positive patients exhibited significantly increased risk 
for recurrence, 8.03-FI (P = 0.006) and 2.09-FI (P = 6.1 X 10"*), 
respectively. For patients with tumor size <T3 and sT3, N39- 
positive patients displayed significant increased risk for recurrence, 
2.56-FI (P = 0.002) and 1.69-FI (P = 0.044), respectively. For 
patients with ER negative status, N39-positive patients had a mar- 
ginally increased risk for recurrence (1.59-FI, P = 0.057), while for 
the ER positive group, N39-positive patients exhibited significantly 
increased risk for recurrence, 2.7-FI (P = 0.004). Breast cancer is 
strongly related to age with —80% of breast cancer occurring in 
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Table 1 N39 gene set 


Gene symbol 


Gene title 




Qusnosins kinoss 


AP2B1 


<J(J<jpi(Jl 1 clU lc(J UiUlcul LUlllUIcA Z., UclU 1 oUUUIIII 


AVL9 


/AVL7 MUIIIUIUU lO. CfcJl cVlbl (Joel 
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colnsxin 
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dihydrolipoomids bronchsd choin tronsocylos© E2 


DHRS7 


dshydroQsriQss/rsductQS© (SDR fomilyj msmbsr 7 




downstrsom nsiQhbor of SON 


FAM 1 90S 


tnmi V with ^pnii^nr'^ ^imi nritv 1 VO mpmnpr R 


FGFRl 


IIUIUUILlbl yiUWIII lULKJI IcLcUllJI 1 


FOXN3 


1 (Jl ICVJVJ \J\JA, I NO 
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nn m mri-n 1 itn mv hvriro n ipon ii inn^p 
^vjiiiiiiu ^luivjiiiyi iiyuiwivjoc i^^iiiu^vjoc. 




folylpolygammaglutamyl hydrolase] 
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in^ii in- Mkp nrowtn frir'tnr ninHinn nrotpin t 

1 1 1 0 U 1 1 M II IvC 1 \J VV III lu^lwl kJIIIUIIIU kJIwIdM 




intsrssctin 2 




laminin, Qamma 1 (formsrly LA^AB2) 
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leukemia inhibitory factor receptor alpha 
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mptn\/ltr/~in crpr/icp IiI^p / A 
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IIIC?l\JII^II ll^l IC^II 1 1 /\ 
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mptn ntninnpin xA 

1 1 IC^IVJII^II ll^l IC^II 1 /V 


NAB/ 


NGFI-A binding protein 1 (EGRl bindinc] protein 1) 




nuclear receptor coactivator 1 




nuclear receptor corepressor 1 


PAPOL4 


poly(A) polymerase alpha 


PPMEl 


piUlcNI pilCJ5Ull(JI(Jbc 1 1 IcI 1 1 y Icblcl Doc 1 


ppp ipni 

rrr 1 1\ i ^jl 


piUlclM pil(J5Ull(JI(J5c 1 , 1 cUUlU l(JI y oUUUIMI 1 O Mls.c 


PRKAR2A 


nrntpin Icinn^p r"AK^P-npnpnnpnt rpmi ntnrv tvnp 

LJIuldM IvMI^JjC!?^ \-,r\l V I J ^JC? LJC^I IVJI^IIIy IC'LJUIUI^iy^ \ \ J 
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rnnnntin RAn (^TPn^p ninninn pffpptnr nrntpin 1 

1 kJVJ 1 1 1 1 ^ l\/^L' \^ II U 0~ kJMIVJIJIVJ dl C?v> \\J\ kjiwldll 1 


RBBP8 


retinoblastoma binding protein 8 


SGPLl 


oUi 1 1 1 lUUol 1 Ic 1 pi ItJbpi l(J Ic lyLioc 1 


SIRTl 


sirtuin 1 


SNX2 


sorting nexin 2 


SREKl 


splicing regulatory glutamine/lysine-rich protein 1 


TAFIB 


TATA box binding protein (TBP)-associated factor, RNA 




polymerase 1, B, 63 kDa 


TMED5 


transmembrane emp24 protein transport domain 




containing 5 


ZMIZ2 


zinc finger, MIZ-type containing 2 



women age >50. We demonstrated that N39-positive women age 
<50 exhibit a 1.9-FI (P = 0.020) whereas women age >50 exhibit a 
2.64-FI increased risk for recurrence (P = 8.4 X 10"''). Kaplan-Meier 
survival curves confirmed a significantly reduced survival for N39- 
positive patients in each subset grouped by age, lymph node status, 
tumor size, and ER status (Figure 5B). 

Discussion 

NAMPT is a novel cancer marker'' and therapeutic target^*" with 
unclear mechanisms of action. Regardless of intrinsic complex bio- 



logically function with differential roles as secreted proinflammatory 
cytokine (extracellular NAMPT) or rate-limiting NAD* synthesis 
enzyme (intracellular NAMPT), we looked into the prognostic power 
with the gene sets regulated by NAMPT. Firstly we confirmed the 
critical role of NAMPT in carcinogenesis by the gene ontology ana- 
lysis of all NAMPT-mediated genes: eight of the eleven significantly 
deregulated pathways are direct cancer pathways (Fig. 1). Secondly, 
we generated the N39 signature by filtering through gene express 
data sets of four cancer types. Thirdly, we validated N39 signature as 
a powerful tool to provide important prognostic lung and breast 
cancer and determined the N39 gene signature as a significant and 
independent predictor of cancer recurrence-free survival. 

We chose lung and breast cancers to serve as the validation study 
for cancer survival prognosis, mainly dependent on the availability of 
the datasets (three independent studies to serve as one discovery 
cohort and two validation cohorts). Moreover, this choice of cancer 
type selection is based on the severity of the two types of cancer. Lung 
cancer is the most frequently diagnosed cancers and leading cause of 
cancer death in males, comprising 17% of the total new cancer cases 
and 23% of the total cancer deaths^'. In females, breast cancer is the 
most frequently diagnosed cancer and the leading cause of cancer 
death, accounting for 23% of the total cancer cases and 14% of the 
cancer deaths^'. 

Prognostic molecular signatures that work cooperatively with tra- 
ditional clinical and pathological factors may increase prognostic 
accuracy when identifying patients at higher risk for recurrence 
and death^""^". Our proposed molecular signature that is composed 
of 39 NAMPT-mediated genes is a promising prognostic marker, 
because N39 was solely developed based on the discovery cohort 
and its prognostic power was validated in two independent valid- 
ation cohorts for lung and breast cancer, respectively. More impor- 
tantly, N39 was independent of other clinicopathological covariates. 
In the Lungl cohort, when grouped by age, £GPP/KZMS/ALK altera- 
tion status, and smoking history, N39 further stratified lung cancer 
patients with significant differences in survival. A significantly 
increased risk of recurrence was also observed in N39-positive 
patients of stage I. However, we failed to observe a significant differ- 
ence between N39-positive and -negative groups among the patients 
of stage II, which may be due to the relatively smaller sample size in 
this category. To validate the prognostic power of N39 in stage II 
tumor, we further included an additional lung cancer dataset 
(GSE41271)^' here. We merged the subjects of stage II from three 
independent cohorts (Lungl, Lung2, and GSE41271) using the 
"metaArray" package in Bioconductor (see Supplementary Fig. S5 
for details). We found that N39-positive patients (with stage II 
tumor) exhibited a significantly increased risk (1.68-FI, P = 0.049 
by univariate Cox proportional hazards regression) for recurrence 
comparing with N39-negative patients. Also, Kaplan-Meier survival 
curves confirmed a significantly reduced survival (P = 0.047 by log- 
rank test) for N39-positive patients of stage II (Supplementary Fig. 
S5). In the Breast 1 cohort, we stratified the patients according to age, 
lymph node status, tumor size, and ER status, respectively. A signifi- 
cantly increased risk of recurrence was also observed in N39-positive 
patients in each category, except for the marginal signal in ER nega- 



Table 2 | Cox proportional hazards regression of survival by N39 status in lung and breast cancers 

Training cohort Validation cohort 

Cancer n HR 95% CI P-value Cohort n HR 95% CI P-value 



Lung 


138 3.18 


1.91,5.29) 


8.4 X 10^-^ 


Lungl 


226 


2.88 


(1.69,4.95) 


1.2 X lO"-* 










Lung2 


96 


2.08 


(1.17,3.70) 


1.3 X 10-^ 


Breast 


286 2.76 


1.83,4.14) 


1.1x10-* 


Breast] 


508 


2.27 


(1.53, 3.37) 


4.8 X 10-= 










Breast2 


252 


2.12 


(1.36, 3.30) 


9.5 X 10-" 


Note - n: sample size 


HR: hazard ratio; CI: confidence 


interval. 
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Figure 3 | Kaplan-Meier curves for patients in validation cohorts. The expression of N39 predicts poor recurrence-free survival in lung (Lungl and 
Lung2 cohorts) and breast (Breasti and Breast2 cohorts) cancers. Red curves are for N39-positive patients while blue curves are for N39-negative patients. 
N39-positive patients were defined as those having a risk score greater than the group median. P- values were calculated by log-rank tests for the differences 
in survival between N39-positive and -negative groups. 
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Figure 4 | Non-random prognostic power of N39 in lung and breast cancers. Z denotes the Wald statistic, the ratio of Cox regression coefficient 
to its standard error. The black triangles stand for the 2 values of N39. The grey areas show the distributions of Z values for the 1,000 resampled gene 
signatures with identical size as N39 under the nuU hypothesis of no association between N39 and recurrence-free survival. Right-tailed P-values of the 
sampling distribution were calculated. 



Table 3 | Multivariate Cox proportional hazards regression of survival in the validation cohorts 

Cohort Covariate HR 95% CI P-value 



Lungl 


N39 + vs. - 


2.51 


(1 .44, 4.37) 


1.2 X 


10-3 




Age (per year) 


1.04 


(1.00, 1.08) 


2.9 X 


10-2 




Gender male vs. female 


0.70 


(0.35, 1.41) 


3.2 X 


10-' 




Smoking -1- vs. — 


1.40 


(0.72, 2.75) 


3.2 X 


10-' 




Stage 


2.86 


(1.68,4.85) 


9.8 X 


10-^ 




£GF/?/KRAS/Ai/f alteration + vs. - 


0.57 


(0.34, 0.96) 


3.6 X 


10-2 




MYC level high vs. low 


0.90 


(0.35, 2.32) 


8.3 X 


10-' 


Lung2 


N39 + vs. - 


2.32 


(1.27,4.25) 


6.4 X 


10-3 


Age (per year) 


1.00 


(0.97, 1 .04) 


8.0 X 


10-' 




Gender male vs. female 


0.85 


(0.47, 1 .54) 


5.9 X 


10-' 




Stage 


1.44 


(0.99, 2.09) 


5.5 X 


10-2 


Breasti 


N39 + vs. - 


1.97 


(1.26, 3.07) 


2.9 X 


10-3 




Age (per year) 


1.00 


(0.98, 1.02) 


9.6 X 


10-' 




Lymph node -1- vs. — 


2.88 


(1.66,5.00) 


1.8 X 


10-" 




Grade 3 vs. 1 ,2 


0.73 


(0.45, 1.18) 


2.0 X 


10-' 




Tumor size >T3 vs. < T3 


1.65 


(1. 11, 2.46) 


1.3 X 


10-2 




ER + vs. - 


0.52 


(0.30, 0.90) 


2.0 X 


10-2 




PR + vs. - 


0.66 


(0.39, 1.14) 


1.4 X 


10-' 


Breast2 


N39 + vs. - 


1.97 


(1.13, 3.43) 


1.7x 


10-2 




Age (per year) 


1.00 


(0.98, 1.02) 


8.5 X 


10-' 




Grade 3 vs. 1 ,2 


0.92 


(0.52, 1.63) 


7.7 X 


10-' 




ER + vs. - 


0.70 


(0.26, 1.86) 


4.7 X 


10-' 




PR + vs. - 


1.35 


(0.52, 3.50) 


5.4 X 


10-' 




7P53 alteration + vs. — 
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Note - HR: hazard ratio; CI: confidence interval. 
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Figure 5 | Kaplan-Meier curves of patient cohorts grouped by clinical and pathological factors. (A) N39 is independent of traditional 
clinicopathological factors in lung cancer. Patients are stratified by age, stage, EGFR/KRASMIiC alteration status, and smoking history. (B) N39 is 
independent of traditional clinicopathological factors in breast cancer. Patients are stratified by age, lymph node status, tumor size, and ER status. Red 
curves are for N39-positive patients while blue curves are for N39-negative patients. N39-positive patients were defined as those having a risk score greater 
than the group median. P-values were calculated by log-rank tests for the differences in survival between N39-positive and -negative groups. 



five patients. Taken together, these results confirm that N39 is not 
dependent on specific values of the respective covariates status, 
which enhances the identification of cancer patients at greater risk 
for recurrence. 

We used the median of N39 risk score as a cutoff to stratify patents 
into two groups (N39-positive and -negative) to conduct categorized 
statistical analyses (such as Kaplan- Meier analysis and log-rank test). 



Clinically, zero can be utilized as an absolute cutoff to divide patients 
into high- and low-risk groups, as the median of N39 score is appro- 
ximately equal to zero in each validation cohort (Supplementary 
Fig. S6). 

In addition to its prognosis utility, N39 gene list also provides a set 
of NAMPT associated genes that might play critical roles in cancer 
pathogenesis. One good example is SIRTl. NAMPT-SIRTl-MYC 
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axis critically regulates cell survivaF^. SIRTl is also found over- 
expressed in many cancers and frequently NAMPT is concurrently 
over-expressed with SIRTl ^ which is important for prostate cancer 
cell survival and stress response^^^. A recent study in pancreatic cancer 
lines, however, suggested that NADase CD38 but not SIRTl is crucial 
for pancreatic cancer cells' response to iVAMPT inhibition^^^, suggest- 
ing the complex interaction of NAMPT with SIRTL These previous 
findings, together with N39 signature, have generated novel biomar- 
kers or therapeutic targets in cancer. 

This study is an example of re- examination of available genomic/ 
genetic data in the "big data" era, with a novel translational approach. 
NAMPT is confirmed to be a novel "oncogene" with a central role in 
carcinogenesis despite a clear molecular mechanism elucidated. In 
addition to cancer prognosis, now well validated in the current study, 
N39 has promise in the management of multiple cancers. 

Methods 

Expression microarray data. We obtained the gene expression data of WT and 
NAMPT KD MCF-7 breast cancer cells (GSE13449)^' and of WT and NAMPT KD 
pulmonary microvascular endothelial cells (GSE34512)^^ from the NCBI GEO 
database'^. The gene expression data of paired normal and tumor tissues for lung 
(GSE18842)'^ colon (GSE23878)^^ pancreatic (GSE15471)^^ and thyroid 
(GSE33630) cancers were also collected from the GEO database. Training and 
validation cohorts were constructed for lung and breast cancers. From the GEO 
database, we collected the expression datasets with available information on 
recurrence- free survival for lung (GSE8894^^ for training and GSE31210^' and 
GSE37745'' for validation) and breast (GSE2034''' for training and GSE25066'^ and 
GSE21653^'* for validation) cancers. 

Microarray data processing. The GC robust multichip average algorithm^^ was used 
to summarize the expression level of each probe set for the microarray data of WT and 
NAMPT KD human cells and of paired normal and tumor tissues. Only the probe sets 
present (determined by function "masScalls" in the Bioconductor "aff)'" package^^) in 
at least two thirds of the samples were retained. We further limited our analysis to the 
probe sets with unique annotations and removed genes on chromosomes X and Y to 
avoid potential confounding factors. Significance analysis of micro arrays^^, 
implemented in the samr library of the R Statistical Package, was used to compare 
log2- transformed gene expression levels between WT and NAMPT KD human cells. 
FDR was controlled using the q- value method'^-^^. Transcripts with a fold- change 
greater than 1.1 and FDR less than 0.05 were deemed differentially expressed. 

Risk scoring system. For each training cohort, univariate Cox proportional hazards 
regression was used to evaluate the association between recurrence-free survival and 
gene expression. A risk score was then calculated for each patient using a linear 
combination of gene expression weighted by the Wald statistic (ratio of regression 
coefficient to its standard error) as shown below: 

s=^Zi(.,-,,,)A, 



In the equation above, S is the risk score of patient; n is the number of genes; Z, 
denotes the Wald statistic of gene /; e, denotes the expression level of gene /; and and 
T, are the mean and standard deviation of the gene expression values for gene / across 
all samples, respectively. Patients were then divided into positive and negative groups 
with the median of the risk score as the threshold. A higher risk score implies a poor 
outcome. The scoring system and the associated scaling coefficients were fixed based 
on the training cohorts and then evaluated in the validation cohorts. All statistical 
analyses were conducted using the R platform (version 2.15.1). The a level for all the 
statistical tests was 0.05. 
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